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Abstract. Wc show that both Teichmiincr space (with the Teichmiiller metric) 
and the mapping class group (with a word metric) have geodesic divergence that 
is intermediate between the linear rate of flat spaces and the exponential rate of 
hyperbolic spaces. For every two geodesic rays in Teichmiiller space, we find that 
their divergence is at most quadratic. Furthermore, this estimate is shown to be 
sharp via examples of pairs of rays with exactly quadratic divergence. The same 
statements are true for geodesic rays in the mapping class group. We explicitly 
describe efficient paths "near infinity" in both spaces. 



1 Introduction 

The volume of a ball in Teichmiiller space grows exponentially fast as a function of 
its radius, as in the case of hyperbolic space. In this paper, we show that despite 
this, the "circumference" of the ball grows only quadratically. To be precise, for 
two geodesic rays 71 and 72 in a proper geodesic space X with a common basepoint 
X G X, let their divergence be the infimal length of all paths connecting 71 (t) to 
72 (t) which maintain a distance at least t from the basepoint: 
div(7i,72,t) = d\six\Bt(x){li{t),l2{t)) , 
where Bt{x) is the open ball of radius t about x. Note that div may be infinite in 
this generality. 

This definition is formulated to realize the trichotomy that positive, zero, and 
negative curvature correspond to sublinear, linear, and superlinear divergence of 
geodesies, respectively; the correspondence is discussed further below. The main 
results of this paper provide an upper bound for the divergence of geodesies in 
Teichmiiller space and the mapping class group for surfaces of finite type. 

Throughout this article, we compare non-negative functions f{t) and g{t) using 

the symbols >~, ^ , ^, -< to denote equality or inequality with respect 

to an additive constant, a multiplicative constant, or both, respectively, where the 
constants depend on the topology of S only. For example, f{t) x g{t) means that 
there are constants ci and C2 depending only on the topology of S such that 

i^f{t)-c2<g{t)<cif{t) + c2 yt. 
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Figure 1: A path connecting 71 (i) to 72 (i) tliat stays outside the open ball of 
radius t centered at the basepoint. 



We may refer to this as the order of a function; for instance if f{t) x we may say 

that / is on the order of t^. Note that sometimes the distinction between ~< and -< 
is crucial, as in many applications of Theorem 12.31 below. Note also that although 
much of the exposition is streamlined by this notation, it is sometimes necessary to 
pay attention to the constants, as when considering functions f{t) x t'^ and g{t) x t'^ 
and trying to prove that f{t) — g{t) x t^. 

With this equivalence relation, any two linear functions (respectively, polynomial 
of degree n) are identified. Having / 1 means the function is bounded above. 

Theorem A. Let S be a surface of genus g with p punctures, such that 3^ + p > 4. 
Let X be either the Teichmiiller space T{S) with the Teichmiiller metric or the 
mapping class group Mod(5') with a word metric from a finite generating set. For 
any pair of geodesic rays 7i(i),72(i) with a common basepoint x & X, 

div(7i,72,t) -< t^. 

To accomplish the quadratic upper bound, we explicitly construct paths that 
travel through chains of product regions. In the Teichmiiller case, this path travels 
through the thin part of T{S), which is stratified into regions which have a prod- 
uct structure, up to additive distortion. Estimating the length of the path uses a 
combinatorial formula for distance in Teichmiiller space (Theorem 12. 4p . In the case 
of the mapping class group, we use the quasi-isometrically embedded copies of 
generated by pairs of Dehn twists about disjoint curves. 

This theorem does not provide a quadratic or even linear lower bound for all 
divergence rates in either space X, and indeed none is possible since in both cases 
there are non-diverging pairs that pass every threshold separation. That is, based 
at every point in Teichmiiller space or the mapping class group, and for arbitrarily 
large M > 0, there are pairs of rays with limsupdiv(7i,72,t) = M. One constructs 
these examples in Teichmiiller space from pairs of quadratic differentials with the 
same underlying topology (see |Mlj ): in the mapping class group, one uses the many 
undistorted copies of Z^. 

On the other hand, the upper bound is sharp. 
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Figure 2: An efBcient path chains through product regions in the space. 



Theorem B. For each X as above, there exist pairs of rays 71 , 72 at every basepoint 
such that div(7i,72,t) x t^. 

Examples realizing the quadratic rate are obtained from axes of pseudo-Anosov 
mapping classes. The lower bound on the divergence is furnished by means of 
quasi-projection theorems to these axes; in Teichmiiller space, this is an immediate 
application of a result of Minsky, while in the mapping class group an analogous 
result is obtained using hyperbolicity of the curve complex. (See (B] for a different 
treatment of divergence in the case of the mapping class group.) 

We remark that when 3g+p = 4 (that is, when S is either a once-punctured torus 
or a four-times-punctured sphere), the Teichmiiller metric on T{S) has negative 
curvature and the divergence is exponential. In higher complexity, Teichmiiller space 
is not hyperbolic ( |MW] . [Mil] ). However, there is a long-standing analogy between 
the geometry of Teichmiiller space and that of a hyperbolic space ( |Be| . |Kej . |M2] . 
|Wlj . |Mi2j ). This paper provides another point of view from which the Teich- 
miiller space is quite different from a hyperbolic space. 

These results should also be regarded as showing that T{S) and Mod(S') have 
intermediate divergence in the following sense. A proper geodesic space X is said to 
have exponential divergence if there exists a threshold value D > such that for all 
rays 71,72, 



Gromov hyperbolicity ((5-hyperbolicity) is then equivalent to exponential divergence 
{D = 35 will suffice; see |BrHj ). On the other hand, in flat spaces, every two geodesic 
rays have a linear divergence function. 

A geodesic space can be said to have intermediate divergence if no rays diverge 
faster than f{t), and some diverge at the rate f{t), for a function growing super- 
linearly but subexponentially. 
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In [Gl,2], Gersten attributes to Gromov the expectation that there should be 
no nonpositively curved spaces of intermediate divergence. However, Gersten con- 
structed such an example, giving a finite CAT(O) 2-complex whose universal cover 
possesses two geodesic rays which diverge quadratically and such that no pair of 
rays diverges faster than quadratically. Furthermore, in those papers, Gersten in- 
troduced a variation on divergence which appeals directly to the metric with no 
reference to geodesies. (He measures the lengths of paths staying far from a base- 
point X between pairs of points at a fixed distance from x, with a slightly weaker 
equivalence relation on rates of growth than the one given here.) The advantage of 
this approach is that it produces a quasi-isometry invariant, so that the divergence 
of a finitely generated group can be discussed without specifying a generating set. 
Gersten and Kapovich-Leeb ( |G2j , [KLj ) study divergence in 3- manifold groups and 
find that some (namely graph-manifold groups) have quadratic divergence in the 
same sense as above. Here, we show that the same behavior occurs in the spaces 
T{S) and Mod(S'), which have been extensively studied in their own right. Note 
that our Theorem A implies quadratic divergence in Gersten's coarse sense, for the 
word metric with respect to any finite generating set. 

By contrast, symmetric spaces have a gap in the possible orders of div(— , —,t) 
between linear and exponential rates; there, quadratic divergence never occurs for 
any pair of rays [Gr| . Thus this paper also highlights the limitations of the long- 
standing analogies between T{S) and symmetric spaces. 

Acknowledgments. The authors would like to thank the Fields Institute for host- 
ing the workshop at which these results were first conceived. We also thank Howard 
Masur, Yair Minsky, Saul Schleimer, and Dylan Thurston for very helpful conversa- 
tions during the conference. In particular, Saul Schleimer observed that the upper- 
bound arguments for Teichmiiller space would essentially carry over to the mapping 
class group. Thanks also to Misha Kapovich, Rohit Thomas, and an anonymous 
referee for helpful comments on the paper. 

2 Background 

Let S be an orientable, connected topological surface of genus g with p punctures. 
Throughout this paper, we assume that 3g + p > 4. In this section, we review some 
background material and establish a few lemmas. 

The space of curves and arcs. Let Y be an essential subsurface of S (if it is a 

proper subsurface, then it has boundary, and we write dY for that set of curves). By 
a curve in Y we mean a non-trivial, non-peripheral, simple closed curve in Y and by 
an essential arc we mean a simple arc, with endpoints on the boundary of Y, that 
cannot be pushed to the boundary of y. In case Y is not an annulus, the homotopy 
class of an arc is considered relative to the boundary of Y; when Y is an annulus, 
the homotopy class of an arc is considered relative to the endpoints of the arc. We 
will use Da to denote the Dehn twist about a simple closed curve a. 
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Let C{Y) be the set of all homotopy classes of curves and essential arcs on the 
surface Y. We define a distance on C{Y) as follows: for a, /3 € C{Y), define dy (a, /3) 
to be equal to one if a 7^ /? and a and (3 can be represented by disjoint curves or 
arcs. The metric on C(Y) is the maximal metric having the above property. Thus, 
dy(a,/3) = n if a = 7o, 71, . . . , 7n = /3 is some shortest sequence of curves or arcs 
on S such that successive 7^ are disjoint. Note that the notation C{Y) is often used 
for the complex of curves on Y (see |MMlj for definitions). Instead, we use C{Y) to 
denote the zero-skeleton of the complex of curves and arcs, with distance induced 
by the one-skeleton. 

In [MMlj . Masur and Minsky show that C{S) with the above metric is S- 
hyperbolic in the sense of Gromov, and consequences of this are discussed further 
below. The curve complex is well known (and often proved) to be connected. The 
isometry group of C{S) is the mapping class group Mod(S') of S, as shown by Ivanov, 
Korkmaz, and Luo. 

Subsurface projections. For a a curve in S, we define the subsurface projection 
of a to the essential subsurface Y as follows: Let 

f -.S ^ S 

be a regular covering of S such that /*(vri(.S)) is conjugate to iti{Y) (this is called the 
y-cover of S). Since S admits a hyperbolic metric, S has a well-defined boundary at 
infinity, and we use the same notation S to denote the cover with its boundary added 
when appropriate. Let a be the lift of a to S. Components of a that are essential 
arcs or curves on S, if any, form a subset of C{S). The surface S is homeomorphic 
to Y. We call the corresponding subset of C{Y) the subsurface projection of a to 

Y and will denote it by ay. If there are no essential arcs or curves in a, then ay 
is the empty set; otherwise we say that a intersects Y essentially. This projection 
only depends on a up to homotopy. 

Let a and a' be curves in S that intersect a subsurface Y essentially. We define 
the projection distance between a and a' to be the maximum distance in C{Y) 
between the elements of the projections ay and a'y, and denote it by dy(a,a'). If 

Y is an annulus whose core is the curve 7, then we may denote C{Y) by C(7) and 
dy(a,a') by d^{a,a'). 

Many of the results in the Masur-Minsky papers on the geometry of the curve 
complex proceed by analyzing the structure of C{S) in terms of subsurfaces Y by 
means of subsurface projection. For instance, here is a useful result from |MMlj . 
Theorem 2.1 (Theorem 3.1 in |MMlj ). Tiiere is a constant Mq such that if the 
projection distance to some subsurface Y satisfies 

dY{a,(3) > Mo, 

then any geodesic in C{S) connecting a to P intersects the 1-neighborhood of dY. 

Markings. Following Thurston, a pants decomposition on 5 is a maximal collec- 
tion of disjoint curves; these 3g — 3 + n curves are called the pants curves of that 
decomposition. The Fenchel-Nielsen coordinates on T{S) are obtained by assigning 
a length and a twist coordinate to each curve in a pants decomposition. Alterna- 
tively, a second set of curves may be chosen, transverse to the first, so that lengths 
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of all 6g — 6 + 2n curves give approximate coordinates on T{S). Along these lines, 
we define a marking on to be a set of pairs of curves fi = {(qi, . . . , {am, (3m)} 
such that {«!, . . . , am} is a pants decomposition of 5, the curve (3i is disjoint from 
aj when i ^ j, and intersects ai once (respectively, twice) if the surface filled by 
Oj and /3j is a once-punctured torus (respectively, a four-times-punctured sphere). 
The pants curves a^ are called the base curves of the marking fi. For every i, the 
corresponding /3j is called the transverse curve to Oj in /x. When the distinction 
between the base curves and the transverse curves is not important, we represent 
a marking as simply a set of 2m curves. Denote the space of all markings on S by 
M{S). 

Still following |MM2j . there are two types of elementary moves in Ai{S). 

1. Twist: Replace /3j by /3^, where is obtained from /3j by a Dehn twist or a 
half twist around Oj. 

2. Flip: Replace the pair (aj,/3j) with (/3j,aj); also "clean up" by, for j ^ i, 
replacing /3j with a curve (3^ that does not intersect /3j (the new base curve) 
in such a way that daj{/3j,f3j) is as small as possible (see jMM2] for details). 

In the first move, a twist can be positive or negative. A half twist is allowed when 
Qj and f3i intersect twice. Masur-Minsky analyzed the geometry of the curve com- 
plex by finding efficient paths of markings through elementary moves. The follow- 
ing theorem is a version of their result in which the high powers of Dehn twists 
are rearranged to appear consecutively; this adaptation facilitates computations of 
Teichmiiller distance through changes of marking. 

Theorem 2.2 |R1] . There exists a constant K (depending on S) such that for any 
two markings ??2 on S, there is a path of markings 

r]i = m, . . . , = V2 , 
where (li and fJ-i+i differ hy an elementary move except that, for each a with 
t^a(^i5^2) > K, there is a unique index ia such that 

l^i^+i = D^f^i^ with \p\ i da{vi,V2) ■ 

This path is efficient: 

n >; ^ [dYivi,m)]i^, 

YCS 

where the sum is over non-annulus subsurfaces Y and [A^]k := 

Product regions in Teichmiiller space. Let 7~(S') denote the Teichmiiller space 
of S equipped with the Teichmiiller metric. A point r G T(S') is a hyperbolic metric 
on S (constant curvature —1). Minsky has shown that the thin part of Teich- 
miiller space has a product-like structure, as we now describe. Let F be a set of 
disjoint curves on S and let Thine(F) denote the set of points in Teichmiiller space 
such that all curves from F are short in hyperbolic length: 

Thin,(F) := {r € T{S) \ lr{l) < e for all 7 G T} . 
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These cover the e-thin part of Teichmiiller space, which consists of those metrics 
with some short curve: 



Thin, := {r € T{S) \ injrad(r) < e) . 
The e-thick part is the complement of Thin,. Let Prod(r) denote the product space, 



where \ F is considered as a surface of lower complexity and each is a horoball 
in the copy of the hyperbolic plane parametrizing the Fenchel-Nielsen coordinates 
corresponding to a short curve 7 (the x-coordinate in H-y represents the twist pa- 
rameter along 7 in (7 and the y-coordinate represents the reciprocal of the length 
of 7 in a; see |Milj ). Endow Prod(r) with the sup metric. Minsky has shown, for 
small enough e, that Thine(r) is well-approximated by Prod(r). 

Theorem 2.3 (Product regions [Mil]). The Fenchel-Nielsen coordinates on T{S) 
give rise to a natural homeomorphism tt: T{S) — >■ Prod(r). There exists an eo > 
sufficiently small that this homeomorphism restricted to Thin^g (F) distorts distances 
by a bounded additive amount. 

Note that T{S \T) = Yly'^O^)^ where the product is over all connected com- 
ponents y of S \ r. Let ttq denote the coordinate factor of vr mapping to T{S \ T), 
let Try denote the coordinate factor mapping to T(Y), and, for 7 G L, let vr^ denote 
the coordinate factor mapping to M^. 

Short markings and Teichmiiller distance. Let u be a point in the Teich- 
miiller space T{S) of 5. A short marking on o" is a marking whose curves are chosen 
greedily to be as short as possible with respect to hyperbolic length. That is, let 
Qi be a simple closed curve of minimal length in a, 02 a shortest curve disjoint 
from ai, and so on, to form a pants decomposition of S (the Bers constant gives 
an upper bound on the lengths of curves in a greedily chosen pants decomposition). 
Then, let the transverse curve /3j be the shortest curve intersecting ai and disjoint 
from aj, i 7^ j. There are only finitely many choices in this process. The following 
distance formula relates the Teichmiiller distance between two points <ti and (T2 to 
the combinatorics of short markings in ai and (72. Let eo be as before. Define 
to be the set of curves that are eo-short in both ai and (T2, and, for i = 1,2, define 
Fj to be the set of curves that are eo-short in cjj but not in a^-i. Let r/i and rj2 be 
short markings on ai and CJ2, respectively. 

Theorem 2.4 [Rlj . For sufficiently large K, the distance in T{S) between ai and 
(72 is given by the following formula: 



Prod(F) := T{S \ F) x ]J M. 

7er 



■7 J 




Y 



+ max dH„(cJi,cr2) + max log 



1 



(1) 






as before. 
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For the rest of the paper, we fix eo so that Minsky's product regions theorem 
and Theorem 12.41 hold. 



Subsurface distances and word length in the mapping class group. Con- 
sider the mapping class group Mod(S') of the surface S. We fix a finite set A of 
curves in S such that the set {Dq, : a € A} of Dehn twists around curves in A 
generates Mod(S'). (It follows that the curves in A fill S.) Equip Mod(S') with the 
word metric corresponding to this generating set and denote the word length of an 
element h € Mod(5) by \h\. 

Definition 2.5 (Subsurface distance). Let Y be a subsurface of S and ^1,^2 be 
two sets of curves on S. We define 

dyifJ-i, fJ'2) = max (iy(Qi,a2), 
where (ai, 02) is as defined above. For h € Mod(S'), let 

dY{h) = dY{A,h{A)) , 

A as above. When Y is an annulus whose core curve is 7, we may write d-y{h) 
instead of dyih). 

Note that since A fills S, h{A) fills S as well for every h; therefore, h{A) intersects 
every subsurface essentially and the projection of h{A) to Y is always non-empty. 

Next, we need the following theorem of Masur and Minsky which relates the word 
length of a mapping class h to the subsurface distances corresponding to h. The sum 
in the statement of the theorem is broken into two parts (summing over annular and 
non-annular subsurfaces) to highlight the comparison between this case and that in 
Theorem 12.41 The theorem essentially says that word length is comparable to the 
sum of the very large subsurface projections. 

Theorem 2.6 (Word length in the mapping class group |MM2) ). There is a 
constant Kq such that for every threshold K > Kq there exists c such that for every 
h e Mod(S), 

a Y 

where the first sum is over all curves on S, the second sum is over all subsurfaces of 

{N , N >K, 
0, N<K. 

We will sometimes refer to such a constant c as the word-length constant (for a 
threshold K), taking cq to be the value corresponding to Kq. 

We also need the following few simple lemmas. 
Lemma 2.7. For /ii,/i2,/i3 € Mod(S') and any subsurface Y of S, we have the 
following triangle inequality. 

(iy(/li(A),/l3(A)) ^ dy(/li(A),/l2(A)) +dy(/l2(A),/i3(A)) . 

Proof. The set A has bounded diameter in C{S), therefore its projection to any 
subsurface Y also has bounded diameter. The same is true for /ij(A), i = 1,2,3. 
But dy satisfies the triangle inequality in C(Y). Therefore, the above inequality is 
also true with the additive error of at most 3diam(A) in C{S). □ 



GAFA 



DIVERGENCE OF GEODESICS 



9 



One may think of da as measuring the relative twisting between two curves 
around a. 

Lemma 2.8. For any curve a in S and any mapping class h € Mod(S'), 

da{Dah) yn-c\h\, 
where c is the word-length constant. 

Proof. Using Lemma 12.71 we have 

dMh) = da{A,D'^h{A)) y da{h{A),D^h{A)) - da{A,h{A)) 

)^ n — da{h) . 

Here, the last estimate is valid because h{A) fills, so it contains a curve /3 such that 

da{P,D^P) x n. Finally, Theorem 12.61 implies that da{h) -< c\h\ and the lemma 
follows. □ 

Lemma 2.9. If a and (3 are disjoint, then da{D'^) has an upper bound that is 
independent of n. 

Proof. Let S be the annular cover of S with respect to a, let a be the lift of a that 
is a closed curve and A be the set of lifts of the elements of A which intersect a. 
Let 7 be a curve in S intersecting a but not f3 such that da{A,j) x 1 (the last 
condition can be obtained after applying an appropriate power of to 7). Let 7 
be a lift of 7 to «S that intersects a. 

Since S is hyperbolic, S has a well-defined boundary at infinity. The lifts of f3 are 
arcs with endpoints on the boundary; a Dehn twist around /? in S lifts to shearing 
along these arcs in S. Since 7 is disjoint from all lifts of /3, no amount of shearing 
of A along lifts of /3 can change the intersection number of A with 7. That is, 

d«(7,A) =d„(7,Z)^A) X 1, 

(i„(A,Z)^A) <d,(7,A) + d„(7,I)^A)xl. □ 

Lemma 2.10. For all words h € Mod(<S'), all integers n,m, and all curves a and (3 
such that i{a,(3) = 0, we have 

\DlD^h\ t - \h\ 

for the word-length constant c. 

(Theorem [22D 
[Da and Dp commute) 

(Lemma flM 

(Lemma flM 

□ 



Proof. Without loss of generality, n > m > 0. We have 
\DlD^h\tda{DlDfh)/c 
= da{D^pD^h)/c 

^da{Dl h)/c 

)~ n/c—\h\. 

This completes the proof. 
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Below, we will need to construct paths outside the ball Bt{e) in Mod(S'). To 
initiate, it will be necessary to push away from the ball; the following lemma says 
that for any Dehn twist, a power of either it or its inverse accomplishes this. Let 

(j): N^Mod(5) 

be a geodesic ray in Mod(5) based at the identity. 

Lemma 2.11 (Pushing off). There is a constant d G N such that, for any curve 
a G A and any nonnegative integers n,m,t G N, the magnitudes \D24>{dt)\ and 
\D~"^(j){dt)\ are not both less than or equal to t. 

Proof. Let c be the word-length constant from Theorem 12.61 and set d > c + 2. Let 
h = (p{dt) and suppose, for contradiction, that 

\D';^h\<t and \D~'^h\<t 
for some m,n. Then < 2t. But da{D^~^'^) > m + n — 1. Using the word- 

length constant from Theorem 12.61 we obtain 

m + n <2ct + l. (3) 
Since is a generator for Mod(S'), \D2\ < n. Thus 

t > \D'^h\ > \h\ - >dt-n. 
That is, n > (d — l)t. Similarly, m> [d — l)t and 

m + n>2{d-l)t> 2(c + l)t = 2ct + 2t > 2ct + 1 , 
which is a contradiction. □ 

Consequences of hyperbolicity. Recall that a space is called is 6-hyperbolic 
for some 5 > if every geodesic triangle is (5-thin: each side is contained in a 
(5-neighborhood of the union of the other two sides. Also, if a geodesic is re- 
garded as an isometric embedding from (a subinterval of) M into X, then a Q- 
quasi- geodesic replaces the equality with a coarse equality with additive and multi- 
plicative constant Q. We collect here some standard consequences of hyperbolicity 
for use later in the paper. (See |BrHj for a reference on hyperbolic spaces and 
quasi-geodesics.) 

For the following three lemmas, let A be a 5-hyperbolic space and let L C A be 
a Q-quasi-geodesic line, ray, or segment. Suppose a, 6 G A and a, 6 G L are such 
that d{a, d) and d{b, b) realize the distance from a and 6, respectively, to L. 
Lemma 2.12 (Thin quadrilaterals). There exist constants Mi = Mi{5,Q) and 
M2 = M2{5,Q) such that if d{a,b) > Mi, then any geodesic from a to b intersects 
the M2-neighhorhood of L. 

Lemma 2.13 (Bounded shadows). There exists a constant M3 = M^^SjQ) such 
that for any geodesic segment I from a to d, the closest-point projection of I to L 
has diameter < M3. 

Lemma 2.14 (Bounded projection). There exists a constant M4 = M4^{6,Q) such 
that d{d, 6) < M4 • d{a, 6) + M4. 

Below, we will write Proj^ for the closest-point projection to a quasi-geodesic L. 
Since Proj^(a) is of bounded diameter, this is a coarsely well-defined map. 

Recalling from above the Masur-Minsky result that C{S) is (5-hyperbolic, we will 
reserve the notation 5 for this particular hyperbolicity constant. 
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3 Divergence in Teichmiiller Space 

As before, let T{S) denote the Teichmiiller space of S equipped with the Teich- 
miiller metric. For a quadratic differential g on a Riemann surface of topological 
type S, let [q] be the corresponding point of T{S). (That is, [q] is the hyperbolic 
metric in the conformal class of q; for more background on Teichmiiller space and 
quadratic differentials, see for instance [St], [5], or [H].) Take a G T{S) and two 
quadratic differentials qi and q2 such that [qi] = [52] = o". Let qi{t) and q2{t) be the 
images of qi and q2, respectively, under the time-t Teichmiiller geodesic flow. The 
maps 

t^[q^{t)], i = l,2, 
from [0, 00) to T{S) are geodesic rays in T{S) emanating from a. We want to show 
that for all a, qi, q2 as above and t > 0, there is a path from [(?i(t)] to [g2(i)] in T{S) 
with length on the order of that stays outside Bt{a), the open ball of radius t in 
T{S) centered at a (see Fig. [T|) . 

We will repeatedly use the same argument to show that a point in T{S) maintains 
a distance at least t from a: we show that the value e{t) = e~^* is small enough that 
any point in r € 7~(S') with an e-short curve satisfies d-j-^T, a) > t. Then we carry 
out the appropriate sequence of elementary moves while maintaining some e-short 
curve at all times. 

Constructing a path for the upper bound. To begin the progress from one 
ray to the other, we push off from the ball Bt{(T) of radius t around a so that the 
subsequent moves are guaranteed to stay far from a. It suffices to construct, for 
sufficiently large t, a path between ai = [51 (3i)] and 02 = [92 (3t)] whose length is 
of order t^, while controlling the distance from a. The path will follow a sequence 
of elementary moves, maintaining a sufficiently short curve at all times in order to 
ensure that we stay outside Bt{a). 

Fix e = e~^* and suppose a on 5 and r G T{S) satisfy Iria) < e. In |W2j . it is 
shown that for a ii'-quasi-conformal map between Riemann surfaces, the hyperbolic 
lengths of curves are changed by at most a multiplicative factor of K. Consequently 
the Teichmiiller distance is bounded below by the ratio of hyperbolic lengths for any 
particular curve. 

dr^,r >-log-^>-log^>t. 

That is, this value of e has the property described above that if any curve on a 
surface r is e-short, then r Bt{a). 

We first give the argument under the assumption that the path starts and ends 
in the eo-thick part (that is, cri,a2 Things); we will treat the general case last. 
Recall that eo is chosen as in the product regions theorem. 

Let Hi, . . . , fin be the sequence of markings described in Theorem 12.21 such that 
771 = /ii is a short marking on ai and 7?2 = /u„ is a short marking on a2- Note that 
the condition that ai and <T2 are in the eo-thick part of Teichmiiller space implies 
that the length of r]i in is bounded independent of t. 
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For an elementary move on a marking, let the associated base curve be either the 
twisting curve, if the move is a twist, or the (initial) base curve in the flipped pair, if 
the move is a flip. For the sequence of markings {fii}, let at be the associated base 
curve for the move /ij — )• //j+i and let 7i be any base curve of the marking fii which 
is different from (the complexity condition 3g + p > 4 implies that there are at 
least two base curves in every marking). Note that 7^ and 7j-|_i have intersection 
number zero. For any marking //j and M sufficiently large, the set 

Bi = {T€ T{S) : IrilJii) < M} C T{S) 
is nonempty and has bounded diameter (this follows, again, from the Wolpert for- 
mula). Fix such an M, which is suppressed in the notation Bi from here on. 

Since we can assume t is sufficiently large, we will proceed taking t> maxjdiam Bi} 
and t > log 1/ei, where ei is the injectivity radius of a. 

Let = cJi, r*^ = a2 and, for 1 < i < n, choose to be a point in B^. 

Let r^. be the point of T{S) that has the same Fenchel-Nielsen coordinates as 
r* except that 7^ (which has bounded length in r*) is pinched to have length equal 
to e. Let T^-_-^ and t^^_j^^- be defined similarly. Consider the piecewise geodesic path 
P in T{S) defined by connecting up the points in the sequence below: 

_ 1 JPK 1 2 JPK 2 ^1, 2 _f2^ 3 JPK 3 t-3 , . . . 

1 71 71 7172 72 72 7273 73 

, 1 Pn-2 n—l n Pn-1 n 

^ '7n-27n-l '-Tn-l 7n-l — ^^2 ■ 

There are three kinds of steps: 

{pi) pinches the next curve, ji, by shortening it to length e; 

(ej) applies an elementary move while keeping the curve 7^ short; 

(pi) releases the previous curve, 7^, by restoring it to its length before pinching. 

We may think of pi, pi and Cj as paths in T{S) which are subpaths of P. We 
denote the lengths of these paths by \pi\, \pi\ and |ej| respectively. We will bound 
the lengths of these subpaths and show that they stay outside Bt{a) in order to 
complete the proof of Theorem A for T{S). 

Except along pi and pn-i, at every point in the path P, at least one curve has 
length equal to e; therefore, by the choice of e, these points are outside Bt{a). The 
lengths of the subpaths pi and pn-i are (up to an additive error) equal to t and they 
have one point at distance 3t from a; therefore these subpaths are also outside Bt{a). 

We will estimate the lengths of the pi and pi by showing that 

dT{T\Tl^;)^dT{T\Tl^^_;)::<t, 
and . . + . 

'^r(T;^_i,4-i7J - ^r(T;^_i7,,T;j -t, 

To estimate the distance from r* to r^. , consider first pinching the curve 7^ in r* 
to obtain a new metric r S T{S), where ji has length eo. The path from r* to r 
has bounded length because both r* and r are in Bi, which has bounded diameter. 
Now letting F = {ji}, we note that t, r^, are both in ThineQ(r) and Theorem 12.31 
applies. But their projections to 5\F are identical. Therefore, their distance, up to 
an additive error, is equal to the distance in H^^ between '/r^i(T*.) and vr^. (r), their 
projections to that factor. The points r and r*. have the same twisting parameters 
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around 7^ (the x-coordinates of their projections to H^^ are the same), but different 
length parameters (the y-coordinates are 1/eo and 1/e respectively). Therefore 

c^r (4 , r) i du^^ (vr^, (r4 ) , vr^^ (r)) = i log f x t . 
Next, we estimate the lengths of the ej by showing that 

^ ^ ■ ,.^-|^^ \ ^^"^ ^i+i differ by an elementary move , 

[logp if /ij^+i = Dg/iia > \P\ - da{7]i,m) > K • 
If the elementary move from /Uj to ^i+i is a simple twist or flip, then the length 
of the segment (ej is bounded. This is because vr^, (r*.) and ^T^^{r^f^) are within 
bounded distance, and therefore 

^dr{T\T'^')>^l. 

But there can also be high powers of twists: for every a where daifJ-ijf^n) is large 
there is an index where f^i^+i = DafJ-i^, as in Theorem 12.21 Here the length of 
(ej) is on the order of log ^^(/ii, /i„). Therefore the total length of P is 

y~] \Pi\ + \Pi\ + \ei\ X nt + ^log [da{ni,fJ-n)]^^- 



a 



But Theorem 12.21 and Theorem 12.41 imply 

[dy(/ii,^f„)]^ < dr(o-i,o-2) < 6t . 

YCS 

It follows that the length |P| is at most on the order of 

(6t) t + 6t X t"^. 

This tells us in particular that the elementary moves (which move within a sin- 
gle product region) contribute negligibly to the total length of the path when the 
quadratic rate is realized; the pinch- and-release steps (which pivot from one product 
region to the next) account for the whole length of the path, asymptotically. 

Above, we assumed that the starting and ending points of the path were in the 
eo-thick part. This condition was used in the definition of r* (for example we have 
assumed that € Bi). This assumption is not always true. However, we can 
modify the beginning and the end of the path above to accommodate the general 
case as follows: 

Let r be a point in T{S) with the same Fenchel-Nielsen coordinates as cJi except 
that the length of 71 is e. Let r^^ be the point obtained by increasing the lengths 
of other short curves (lengths less than cq) to a moderate length. We will show 
that these paths Pi = [(Ti,r] and P2 = [t, r^J are outside Bt{a) and have length of 
order t; the rest of the calculation reverts to the arguments above. 

Note that there is a lower bound on the cJi-lengths of all curves. That is, again 
using |W2] and recalling that ei is the injectivity radius at a, 



14 



M. DUCHIN AND K. RAFI 



GAFA 



Since there is no twisting or other change in the marking, it follows from Theorem l2.4l 
that both of the paths Pi and P2 have length at most on the order of t (the only 
contribution to distance comes from the length ratios). The path P2 stays outside 
Bt{a) because along this path the curve 71 has length e. To see that the path Pi 
is outside Bt{a) we take two cases. If the length of 71 in ai is less than e, then 
it remains less than e along this path and we are done. If eg > laiiji) ^ then 

the length of this path is no more than ^ log ^ ^ i by the product regions theorem 
(Theorem 12. 3p . But d'j-{a,ai) = 3t, which completes the argument. 

Applying the same modifications to the end of path P, we obtain the desired 
result for the general case. 

Example realizing the quadratic rate. To see that the quadratic estimate is 
sharp, we must furnish an example of a pair of rays whose divergence rate is exactly 
quadratic. We will use a quasi-projection result of Minsky [Mi2j which shows that 
intervals which are far (relative to their length) from a cobounded geodesic (segment, 
ray, or line) project to sets whose diameter is uniformly bounded above. Recall that 
an e-cobounded geodesic in T{S) is one which stays in the e-thick part. (Note 
that part of the proof entails that Proj, the closest-point projection, is coarsely 
well-defined in the following setting.) 

Theorem 3.1 (Quasi-projection for T{S) |Mi2j ). For every e > there are 
constants bi , 62 depending on e and the topology of S such that the foUowing holds. 
Let G he an e-cobounded geodesic in T{S), suppose r € T{S) satishes d{T, G) > bi, 
and let r = d{T, G) — 61 . Then 

diam(ProjG(P,(T))) < 62 • 

It is straightforward to replace the geodesic in the statement of the theorem with 
a quasi-geodesic. Also note that it suffices to check a bounded geometry condition 
for the endpoints of the geodesic (in T{S) or on the Thurston boundary) to ensure 
that it stays in the thick part |R2j . 

To apply this theorem, we may for instance choose q and q' = —q to point in 
opposite directions along the axis of a pseudo-Anosov mapping class. (Cobounded- 
ness is guaranteed because pseudo-Anosov axes project to closed curves in moduli 
space.) 

Proposition 3.2. For any e-cobounded geodesic G in T{S) and any point a & G, 
let at and a[ be the points on G at distance t from a. Then for any path P in T(5') 
from at to a't which maintains a distance at least t from a, 

\P\ >~ t^. 

Proof. Take Gq to be the subsegment of G of length t, centered at a. Then any path 
P between at and a[ outside Bt{a) maintains a distance at least t/2 from Go at all 
times. 

Now if P is a path connecting at and a't in T{S) of length |P|, we can divide it 
into pieces of length t/2, taking P = /i U • • • U I„ for successive pieces of length t/2 
(with In possibly shorter). The number of these intervals, n, satisfies 

2|P| <nt< 2|P| + t. 
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Each li maintains a distance at least t/2 from Gq. Since < t/2 and t is 
large, !{ is covered by two balls of radius t/2 — bi, centered on the endpoints of the 
interval. Thus Minsky's quasi-projection theorem assures that |Proj(jp(/j)| < 2^2 
for each i. If the endpoints of /; and Xi and Xi-i and Xi are closest-point 

projections to G, we see that d{xi-i,Xi) < 262- But then t = d{xo,Xn) < 2n62, and 
since nt x |P|, we obtain |P| >- t^. □ 



4 The Mapping Class Group 

Constructing a path for the upper bound. Consider two infinite distinct 
geodesic rays, 

(/) : N ^ Mod(S) and V : N ^ Mod(5) , 
in Mod(5) emanating from a common point (without loss of generality, the identity 
element) in Mod(5). We want to show that, for all t G N, there is a path from 
4){t) to il){t) in Mod(S'), such that no point in this path is within distance t of the 
origin, and whose length is on the order of t^. We construct this path by traveling 
iteratively through chained copies of Z^, each copy generated by Dehn twists about a 
pair of disjoint curves. However, first we need to move far enough from the identity, 
to points (t){dt) and ipidt), so that the first few steps of the sequence are sure not to 
backtrack near the identity. This is accomplished by taking d from Lemma 12.111 

Let Bt = Bt{e) be the ball of radius t in Mod(S') about the identity. The segments 
[(j){t) , (f){dt)] and [ip{t),ip{dt)] stay outside Bt and their lengths are of order t. To 
prove Theorem |A] for Mod(S'), it is sufficient to build a path P between the two 
points u = (j){dt) and v = 'ij){dt) that stays outside Bt and has length on the order 

of e. 

The path from uio v will involve high powers of Dehn twists arranged in "switch 
moves" from one twist flat to the next. We fix the exponent m = m{t) to be larger 
than (3dc + c)t (but of order t). 

Lemma 4.1 (Switch moves). For any two curves a,f3 £ A, any w € Mod(S') with 
\w\ < 3dt, and m chosen as above, there is a path (*) from D^w to D^^w staying 
outside Bt and of length -< t. 

Proof. For the curves a, /3, we fix a chain of curves 

{if} = {if. • • • , if}, with k = k{a, (3) , (4) 
having the property that 

a/3 aB n 

a-7i 7fc -/3 

is a path in the curve complex, that is, each adjacent pair of curves is disjoint on S. 
When there is no ambiguity about a. and /3, we denote these curves simply by {74}. 
Define (^-k) to be the following path from D'^w to D'^w: 

D'^w ^ D^^^D'^w'^ D:^^w ^ D^^^D'^lw^ > D^:;^ D"^ w ^ D"^ w . 

(*) 

Every word visited by this path has the form D^_.D^. w for some < m' < m 
and by Lemma [2.101 has word length at least m/c — 3dt > t. Therefore, the subpath 
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(*) stays outside Bt. Also, an upper bound for k = k{a,P) depends on the choice 
of A only. Therefore the length of each subpath (★) (which is 2km) is of order t. □ 

To illustrate how the switch moves work we consider the following example. 
Suppose that a — 71 — 72 — /3 is a path in the curve complex from a to (3 and denote 
the associated Dehn twists by a,g,h and b, respectively. Note that {a,g), {g,h) and 
{h, b) are free abelian subgroups of Mod(S'). Let w be the word 

After cancellation, w is equivalent to b"^a~"^. Assuming that a,g,h and b are all 
in the generating set of Mod(S'), we can also consider w as path along the edges of 
the Cayley graph of Mod(S') connecting a™ to b"^ (first follow edges marked by the 
generator g for m steps, then follow edges marked for m steps, and so on). Note 
that, in the process of carrying out the word, we stay far from the identity in the 
Cayley graph: for any decomposition w = vw', the word w'a"^ contains a high power 
of at least one of a,g, h or b, so its word-length is large. This path starts from a*", 
travels through the quasi-flat {a,g) to g"^, through {g,h) to h"^ and through {h,b) 
to b"^. (On the other hand, the shorter path connecting a™ to 6"^ corresponding to 
the word b'^a~"^ would go through the identity.) 




Figure 3: The path in the Cayley graph pushes off from the ball Bt and moves 
through a chain of Dehn twist flats in Mod(S'). 



Now we will build a path P from utov using these switch moves. Let n = \uv~^ \ 
and write uv~^ = 9n' ■ ■ 9291, where gi = D'^^, ai S A, is a generator. For < r < n, 
let Vr = 9r ' ' ' 9i V, so that vq = v and f„, = u. 



V > gf V > gf Vi > g2Vl > 92^2 — > 93 V2 



(*), „±m„, 9n „±m„, 9n 



Vr, 
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The signs =b are chosen by Lemma [2. ll) which imphes that the segments [v, gi^v] 
and [v,g^"^v] do not both intersect Bf. We choose the segment for the beginning 
of P that is disjoint from Bt- Similarly, the sign for the power at the last step is 
chosen so that the path from w„ to g^^Vn is outside Bt. Note that n < 2dt and 
\vi\ < |u| + n < 3dt; therefore, by Lemma 12.81 and the assumption on m, 

da J {gfvi) y m - Met > ct . 
By Theorem 12. 6( we get \gj^Vi\ > t, which confirms that all of P stays outside Bf. 

We have shown that the length of path P, which contains n — 1 "switch move" 
subpaths, is of order t"^. This finishes the proof of Theorem A for Mod(S'). 

Example realizing the quadratic rate. We now prove Theorem iBl for the case 
of Mod(5) by first establishing a quasi-projection theorem. Note that closest-point 
projection is not in general well-behaved in the mapping class group itself, so the the- 
orem is stated in terms of projection in the curve complex. We map from Mod(S') to 
C{S) by the coarse map g i— t- gA. Note that the word-length formula (Theorem 12. 6p 
ensures that this map is coarsely Lipschitz. 

Distance contraction results for the mapping class group with arguments based 
on subsurface projection appear with various formulations in the literature (includ- 
ing Masur-Minsky, Behrstock, and forthcoming work of Brock-Masur-Minsky). A 
statement is given and proved here in the generality which we will require, providing 
a strong parallel with Theorem 13. 1[ 

A geodesic line, ray, or segment in Mod(S') is called E-cobounded if for every 
proper subsurface Y C S and for every two elements a,b (z G, we have (iy(aA,6A) 
< E. 

Theorem 4.2 (Quasi-projection for Mod(S')). For every E > there exist con- 
stants Bi and B2 depending on E and the topology of S such that the foUowing 
holds. Let G C Mod(S') be an E-cobounded geodesic, suppose g G Mod(5') satisfies 
d{g,G) > B\, and let R = d{g,G)/Bi. Then Q = GA is a quasi-geodesic in C{S) 
and 

diam(Projg(Sij(5)A)) < B2 . 

As before, the theorem could be stated for quasi-geodesics with the same argu- 
ment. Note that for quasi-projection to a segment, it suffices that the endpoints 
a, 6 G Mod(S') satisfy (iy(aA,6A) < £^ in order for the conclusion to obtain. 

For a concrete application, consider a pseudo-Anosov element w and its axis 
in Mod(5'). These axes are known to be cobounded by work of Masur-Minsky. 
We then apply the quasi-projection theorem exactly as above to show that, for 
a high power m, any path P connecting w~"^ to w'^ outside the ball of radius 
t := \w^\ X |m| must have length at least on the order of i^. (Since projection to the 
curve complex coarsely contracts distances, this agrees with the desired inequality.) 

The rest of this section is devoted to proving Theorem 14.21 The constants in the 
rest of this section will depend on the choice of E, which is chosen once and for all. 
Keeping this in mind we can write, for example, 

ds{A,g)>:l. 
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Proof of Theorem \4-S\ Let h G Bii{g). Let a = g{A) and /3 = h{A) (these are sets 
of curves, and they fill S). Let a and /3 be closest-point projections of a and f3 to Q, 
respectively. Choose ^, /i G G so that 

5(A) and 71(A) 

which is possible because Q = GA. Our goal is to show that ds{a,l3) < B2, so 
assume for contradiction that _ 

dsia,/3) > B2 . 





, a 


13 






t 
\ 


\ 

M2 



a ^ g 



-< 59- 

> B2 

Figure 4: We assume for contradiction that the projections of a and (3 io Q 
are far apart in C{S). 



Given any K, c as in Theorem 12.61 we can define 3^ to be the set of proper 
subsurfaces Y S with (iy(a,a) > K. By the theorem's hypothesis, we have 
\a~^9\ = d{g,g) > d{g,G) = BiR and so 

(iy(a,a) + ds{a,oi) > — — — > — - — . 
^-^ c c 

We will proceed in two cases. First assume that 

B\R 

ds{a,a)>^^. (5) 

If B2 > Ml, then any geodesic connecting q to /3 passes through the M2-neighbor- 
hood of ^, by Lemma 12.121 Hence for large enough Bi, we have 

dsia, f3) > dsia,a) - M2 > ^ - M2 > ^ . 

2c 6C 

Enlarging Bi again if necessary, we can assume ds{a, (3) > Kg. Theorem 12.61 implies 

\h-'g\>-ds{a,f])>^>R, 
c 3cco 

for Bi sufficiently large. But this contradicts the assumption that d{g, h) < R. 
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In the second case, the assumption of Equation ([5]) is not true, so we have 

^dYia,a)>^^, (6) 

where again y is chosen relative to the threshold K. 

Next we show that the subsurfaces with large projection distance between a and 
a do not also have a large distance between /? and (3. 

Claim. For every Y ey, dy(/3,^) < Mq. 

Proof. If K > Mo, then Theorem 12.11 implies that any geodesic [a, a] intersects the 
1-neighborhood of dY for every Y y. Applying Lemma 12.131 and Lemma 12.141 we 
can conclude that 

ds{a, Proj(9y)) < M3 + 2M4 . 

If dyiP^P) > Mq for any particular Y (^y, we similarly get 

ds(f3,Fioi {dY)) < A/3 + 2M4. 

But then _ 

dsia,(3) < IMs + iM^. 

For B2 larger than this final constant, this is a contradiction. □ 

Note that g,h £ G and G is a £^-cobounded geodesic. Therefore, for every Y £ y, 

dY{a,P) = dY{g{^)M^)) < E . 

Using the triangle inequality for the projection distance (Lemma 12. 7p we get 

(iy(a, f3) >~ (iy(a, a) — dY{a, 13) — (iy(/3, /3) > dy(a, a) — Mq — E . 
By choosing K large enough, we can ensure for Y £ y that (iy(a,/3) is much 
larger than these additive errors in order to get a multiplicatively coarse equality 

dy(a,/3) X (iy(Q,a) 

as well as ensuring that 

dY{a,(3) > Ko. 

We apply these, as well as Theorem 12.61 one last time, to see that 

CQ\h-'g\>- ^ [dy(ff(A),MA))]^^ 

Yey 

X ^ (iy(a,a) > , 

where the final inequality comes from ([6|). We have shown that 

\h 9\>-T, • 

2cco 

Again, choosing Bi large enough provides the contradiction. □ 
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